AITopics | Pčinja District

Collaborating Authors

Pčinja District

Efficient Evaluation of Quantization-Effects in Neural Codecs

Mack, Wolfgang, Mustafa, Ahmed, Łaganowski, Rafał, Hijazy, Samer

arXiv.org Artificial IntelligenceFeb-7-2025

Neural codecs, comprising an encoder, quantizer, and decoder, enable signal transmission at exceptionally low bitrates. Training these systems requires techniques like the straight-through estimator, soft-to-hard annealing, or statistical quantizer emulation to allow a non-zero gradient across the quantizer. Evaluating the effect of quantization in neural codecs, like the influence of gradient passing techniques on the whole system, is often costly and time-consuming due to training demands and the lack of affordable and reliable metrics. This paper proposes an efficient evaluation framework for neural codecs using simulated data with a defined number of bits and low-complexity neural encoders/decoders to emulate the non-linear behavior in larger networks. Our system is highly efficient in terms of training time and computational and hardware requirements, allowing us to uncover distinct behaviors in neural codecs. We propose a modification to stabilize training with the straight-through estimator based on our findings. We validate our findings against an internal neural audio codec and against the state-of-the-art descript-audio-codec.

artificial intelligence, machine learning, neural codec, (14 more...)

arXiv.org Artificial Intelligence

2502.0477

Country:

Europe > Serbia > Southern and Eastern Serbia > Pčinja District > Vranje (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)

Genre: Research Report > New Finding (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

Artificial Intelligence in Traffic Systems

Saxena, Ritwik Raj

arXiv.org Artificial IntelligenceDec-16-2024

Existing research on AI-based traffic management systems, utilizing techniques such as fuzzy logic, reinforcement learning, deep neural networks, and evolutionary algorithms, demonstrates the potential of AI to transform the traffic landscape. This article endeavors to review the topics where AI and traffic management intersect. It comprises areas like AI-powered traffic signal control systems, automatic distance and velocity recognition (for instance, in autonomous vehicles, hereafter AVs), smart parking systems, and Intelligent Traffic Management Systems (ITMS), which use data captured in real-time to keep track of traffic conditions, and traffic-related law enforcement and surveillance using AI. AI applications in traffic management cover a wide range of spheres. The spheres comprise, inter alia, streamlining traffic signal timings, predicting traffic bottlenecks in specific areas, detecting potential accidents and road hazards, managing incidents accurately, advancing public transportation systems, development of innovative driver assistance systems, and minimizing environmental impact through simplified routes and reduced emissions. The benefits of AI in traffic management are also diverse. They comprise improved management of traffic data, sounder route decision automation, easier and speedier identification and resolution of vehicular issues through monitoring the condition of individual vehicles, decreased traffic snarls and mishaps, superior resource utilization, alleviated stress of traffic management manpower, greater on-road safety, and better emergency response time.

evolutionary algorithm, machine learning, real time system, (19 more...)

arXiv.org Artificial Intelligence

2412.12046

Country:

North America > United States > New Jersey (0.14)
Asia > Singapore (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(17 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)
Transportation > Ground > Rail (1.00)
Government > Regional Government > North America Government > United States Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(6 more...)

Add feedback

Generative AI-based Pipeline Architecture for Increasing Training Efficiency in Intelligent Weed Control Systems

Modak, Sourav, Stein, Anthony

arXiv.org Artificial IntelligenceNov-1-2024

In automated crop protection tasks such as weed control, disease diagnosis, and pest monitoring, deep learning has demonstrated significant potential. However, these advanced models rely heavily on high-quality, diverse datasets, often limited and costly in agricultural settings. Traditional data augmentation can increase dataset volume but usually lacks the real-world variability needed for robust training. This study presents a new approach for generating synthetic images to improve deep learning-based object detection models for intelligent weed control. Our GenAI-based image generation pipeline integrates the Segment Anything Model (SAM) for zero-shot domain adaptation with a text-to-image Stable Diffusion Model, enabling the creation of synthetic images that capture diverse real-world conditions. We evaluate these synthetic datasets using lightweight YOLO models, measuring data efficiency with mAP50 and mAP50-95 scores across varying proportions of real and synthetic data. Notably, YOLO models trained on datasets with 10% synthetic and 90% real images generally demonstrate superior mAP50 and mAP50-95 scores compared to those trained solely on real images. This approach not only reduces dependence on extensive real-world datasets but also enhances predictive performance. The integration of this approach opens opportunities for achieving continual self-improvement of perception modules in intelligent technical systems.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2411.00548

Country:

Europe > Switzerland (0.04)
Europe > Serbia > Southern and Eastern Serbia > Pčinja District > Vranje (0.04)
Europe > Germany > Rhineland-Palatinate (0.04)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study > Negative Result (0.46)

Industry: Food & Agriculture > Agriculture (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

Learning from Naturally Occurring Feedback

Don-Yehiya, Shachar, Choshen, Leshem, Abend, Omri

arXiv.org Artificial IntelligenceJul-15-2024

Human feedback data is a critical component in developing language models. However, collecting this feedback is costly and ultimately not scalable. We propose a scalable method for extracting feedback that users naturally include when interacting with chat models, and leveraging it for model training. We are further motivated by previous work that showed there are also qualitative advantages to using naturalistic (rather than auto-generated) feedback, such as less hallucinations and biases. We manually annotated conversation data to confirm the presence of naturally occurring feedback in a standard corpus, finding that as much as 30% of the chats include explicit feedback. We apply our method to over 1M conversations to obtain hundreds of thousands of feedback samples. Training with the extracted feedback shows significant performance improvements over baseline models, demonstrating the efficacy of our approach in enhancing model alignment to human preferences.

arxiv, category, dataset, (13 more...)

arXiv.org Artificial Intelligence

2407.10944

Country:

Asia > Singapore (0.04)
Europe > Serbia > Southern and Eastern Serbia > Pčinja District > Vranje (0.04)
North America > Canada > Ontario > Toronto (0.04)
(4 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Design Principles for Falsifiable, Replicable and Reproducible Empirical ML Research

Vranješ, Daniel, Niggemann, Oliver

arXiv.org Artificial IntelligenceMay-28-2024

Empirical research plays a fundamental role in the machine learning domain. At the heart of impactful empirical research lies the development of clear research hypotheses, which then shape the design of experiments. The execution of experiments must be carried out with precision to ensure reliable results, followed by statistical analysis to interpret these outcomes. This process is key to either supporting or refuting initial hypotheses. Despite its importance, there is a high variability in research practices across the machine learning community and no uniform understanding of quality criteria for empirical research. To address this gap, we propose a model for the empirical research process, accompanied by guidelines to uphold the validity of empirical research. By embracing these recommendations, greater consistency, enhanced reliability and increased impact can be achieved.

algorithm, experiment, hypothesis, (16 more...)

arXiv.org Artificial Intelligence

2405.18077

Country:

Europe > Serbia > Southern and Eastern Serbia > Pčinja District > Vranje (0.05)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report > Experimental Study (0.47)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Knowledge Guided Semi-Supervised Learning for Quality Assessment of User Generated Videos

Mitra, Shankhanil, Soundararajan, Rajiv

arXiv.org Artificial IntelligenceDec-24-2023

The deep learning based approaches particularly require training on large amount of labelled data, which is Perceptual quality assessment of user generated content cumbersome and expensive to acquire. This leads to us to (UGC) videos is challenging due to the requirement of the question of how we can design NR VQA models which large scale human annotated videos for training. In this can be trained with very limited labelled training data, yet work, we address this challenge by first designing a selfsupervised achieve excellent generalisation performance on multiple Spatio-Temporal Visual Quality Representation datasets in terms of correlation with human perception. Learning (ST-VQRL) framework to generate robust quality Our focus in this work is on designing semi-supervised aware features for videos. Then, we propose a dual-model NR VQA method with limited labelled along with unlabelled based Semi Supervised Learning (SSL) method specifically data. Since UGC videos have diverse quality characteristics, designed for the Video Quality Assessment (SSL-VQA) task, we believe that pretraining a robust video quality through a novel knowledge transfer of quality predictions feature backbone is extremely important to transfer knowledge between the two models. Our SSL-VQA method uses the during semi-supervised learning. With this motivation, ST-VQRL backbone to produce robust performances across we approach the problem using a combination of various VQA datasets including cross-database settings, contrastive self-supervised pretraining followed by semisupervised despite being learned with limited human annotated videos.

quality assessment, ssl-vqa, video, (15 more...)

arXiv.org Artificial Intelligence

2312.15425

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(3 more...)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Perceptual Quality Assessment of Face Video Compression: A Benchmark and An Effective Method

Li, Yixuan, Chen, Bolin, Chen, Baoliang, Wang, Meng, Wang, Shiqi, Lin, Weisi

arXiv.org Artificial IntelligenceOct-29-2023

Recent years have witnessed an exponential increase in the demand for face video compression, and the success of artificial intelligence has expanded the boundaries beyond traditional hybrid video coding. Generative coding approaches have been identified as promising alternatives with reasonable perceptual rate-distortion trade-offs, leveraging the statistical priors of face videos. However, the great diversity of distortion types in spatial and temporal domains, ranging from the traditional hybrid coding frameworks to generative models, present grand challenges in compressed face video quality assessment (VQA) that plays a crucial role in the whole delivery chain for quality monitoring and optimization. In this paper, we introduce the large-scale Compressed Face Video Quality Assessment (CFVQA) database, which is the first attempt to systematically understand the perceptual quality and diversified compression distortions in face videos. The database contains 3,240 compressed face video clips in multiple compression levels, which are derived from 135 source videos with diversified content using six representative video codecs, including two traditional methods based on hybrid coding frameworks, two end-to-end methods, and two generative methods. The unique characteristics of CFVQA, including large-scale, fine-grained, great content diversity, and cross-compression distortion types, make the benchmarking for existing image quality assessment (IQA) and VQA feasible and practical. The results reveal the weakness of existing IQA and VQA models, which challenge real-world face video applications. In addition, a FAce VideO IntegeRity (FA VOR) index for face video compression was developed to measure the perceptual quality, considering the distinct content characteristics and temporal priors of the face videos. Experimental results exhibit its superior performance on the proposed CFVQA dataset. ACE video based services have been growing exponentially, coinciding with the accelerated proliferation of mobile communication and online video content sharing platforms. Face video compression towards human vision, which is indispensable in compressing and delivering gigantic-scale face video data, introduces visual distortions inevitably. During the past decade, advancements in video compression technology have quintessentially benefited face video compression.

assessment, quality assessment, video, (15 more...)

arXiv.org Artificial Intelligence

2304.07056

Country:

Europe > Serbia > Southern and Eastern Serbia > Pčinja District > Vranje (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > South Korea > Seoul > Seoul (0.04)
(3 more...)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology (0.87)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Milestones in Autonomous Driving and Intelligent Vehicles: Survey of Surveys

Chen, Long, Li, Yuchen, Huang, Chao, Li, Bai, Xing, Yang, Tian, Daxin, Li, Li, Hu, Zhongxu, Na, Xiaoxiang, Li, Zixuan, Teng, Siyu, Lv, Chen, Wang, Jinjun, Cao, Dongpu, Zheng, Nanning, Wang, Fei-Yue

arXiv.org Artificial IntelligenceMar-30-2023

Interest in autonomous driving (AD) and intelligent vehicles (IVs) is growing at a rapid pace due to the convenience, safety, and economic benefits. Although a number of surveys have reviewed research achievements in this field, they are still limited in specific tasks, lack of systematic summary and research directions in the future. Here we propose a Survey of Surveys (SoS) for total technologies of AD and IVs that reviews the history, summarizes the milestones, and provides the perspectives, ethics, and future research directions. To our knowledge, this article is the first SoS with milestones in AD and IVs, which constitutes our complete research work together with two other technical surveys. We anticipate that this article will bring novel and diverse insights to researchers and abecedarians, and serve as a bridge between past and future.

artificial intelligence, autonomous driving, machine learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TIV.2022.3223131

2303.1722

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > Arizona > Pima County > Tucson (0.14)
Asia > China > Beijing > Beijing (0.05)
(18 more...)

Genre:

Research Report (1.00)
Personal (1.00)
Overview (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Information Technology > Robotics & Automation (1.00)
Automobiles & Trucks (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback